The goal of causal mediation is, broadly, to decompose the effect of a treatment on an outcome (total effect, TE) into the direct effect (DE) caused by treatment and the indirect effect of treatment by way of some mediators. That is, if we have \(p\) mediators: \[ \text{TE} = \text{DE} + \text{IDE}_1 +\ldots+ \text{IDE}_p = \text{DE} + \text{TIDE} \]
Historically, such models have been fairly simple: one treatment, one mediator, one outcome, some set of confounders. There have been a few papers that allow for multiple outcomes: Schaid et al. (2022) and Zhao et al. (2023). However, both of these methods assume that the treatment, mediators, and outcomes are jointly normal. This precludes the use of a binary treatment.
The proposed model does not make any joint distributional assumptions and allows for binary treatments.
In most of the below models, the mediators are taken to be the taxa in the respective biome, aggregated to the genus level, and transformed with via CLR.
There are several important limitations to note, which are being addressed in current and future improvements:
The outcomes are assumed to be normally distributed
The mediators and outcomes are assumed to be cross-sectional
The effect estimates are biased due to use of a LASSO penalty in the second step of model fitting
The general form of the considered mediation models is seen in Figure 1. The set of mediators consists of some subset of saliva microbiome, plaque microbiome, and oral health outcomes (PI, ICDAS, DS, MS, FS, DT, MT, and FT). The set of outcomes consists of some subset of oral health outcomes (surfaces only) and birth outcomes (Adverse birth outcome and birth weight).
General mediation model
The microbiome data were first grouped at the genus level before being transformed to relative abundance within individual and transformed with the centered log-ratio transformation to better approximate normality.
Plaque index (PI) and the international caries detection and assessment system (ICDAS) are presented on their original scale as their distributions were roughly symmetric. Each of decayed, missing, and filled surfaces (DS, MS, and FS, respectively), as true count data, were transformed with a square root before standardization so as to render their distributions more continuous and/or approximately normal. For example: \[ \text{MS}'=\frac{\sqrt{\text{MS}_i}-n^{-1}\sum_{i=1}^n \sqrt{\text{MS}_i}}{\text{sd}(\sqrt{\text{MS}})} \]
Interpretations, therefore, need to be made in terms of changes in standard deviations of the square root of the considered variable relative to the mean of the variable on the square root scale. Adverse birth outcomes is a binary variable and was thus left as-is albeit in violation of our normality assumption. Birth weight was sufficiently symmetric and was therefore left on its orignial scale.
The set of considered confounders: age, race, Hispanic (Y/N), yeast infection (Y/N), on antiobiotics (Y/N), on antifungal (Y/N), brushing twice daily (Y/N), prenatal inhaler use, prenatal diabetes status, prenatal asthma status, prenatal emotional condition, prenatal hypertension status, prenatal smoking status, employment status, modified education level (middle school/high school, Associates’ degree, or Bachelor’s degree), marriage status, number of children, cortisol level (log), estradiol level (log), progesterone level (log), testosterone level (log), T3 level (log), T4 level (log), and gestational age at first visit.
Included below are tables corresponding to each of the nine considered models. Each table contains the selected partial indirect effects (PIDEs), total indirect effect (TIDE), and direct effect (DE) of exposure (C. albicans is present). Those in the “Estimand” column that are not explicitly labeled with an effect are assumed to be the partial indirect effect from exposure to the corresponding outcome, through that mediator. Additionally, each table contains the mean and standard deviation of 5000 bootstrap samples for each of the above effects, as well as a bootstrap \(p\)-value (where the null hypothesis is an effect of 0) and two forms of bootstrap confidence intervals.
Proceeding each table is a visualization of the (partially) selected directed acyclic graph (DAG). Note that, for several of the considered models, these DAGs do not contain all selected mediators. Rather, due to limitations of interpretability, they include only those mediators that have at least one statistically significant effect. Double arrows from a mediator to an outcome represent mediation paths (i.e. \(\text{exposure}\to\text{mediator}\to\text{outcome}\)) that are statistically significant at the \(\alpha=0.05\) level based on the bootstrap inference without adjusting for multiplicity. After the DAG comes a short summary list containing all the estimated effects for each outcome. The only information contained here that is not also present in the searchable tables are the TEs, which are simply the summation of the TIDE and DE for that given outcome.
Example conclusions can be framed as follows:
Example conclusion for a (significant) PIDE: Bacterium X has a positive/negative (equiv. increasing/decreasing) mediating effect on the amount/number/value of [(sqrt)-outcome] relative to its mean [in terms of SDs]
Example conclusion for a TIDE: The selected mediation profile has an overall positive/negative (equiv. increasing/decreasing) mediating effect on the amount/number/value of [(sqrt)-outcome] relative to its mean [in terms of SDs]
Example conclusion for a (significant) DE: Ca being present has a positive/negative (equiv. increasing/decreasing) effect on the amount/number/value of [(sqrt)-outcome] relative to its mean [in terms of SDs]
Example conclusion for a TE: Ca being present, in combination with the mediation profile, has an overall positive/negative (equiv. increasing/decreasing) effect on the amount/number/value of the [(sqrt)-outcome] relative to its mean [in terms of SDs]
Model 1 DAG
The effect estimates for each outcome in model 1 are as follows:
PI: TIDE = 0.124229, DE = 0.2897, TE = 0.4140
ICDAS: TIDE = 0.0099, DE = 0.2630, TE = 0.2729
DS: TIDE = 0.0117, DE = 0.8351, TE = 0.8467
MS: TIDE = -0.6689, DE = 0.8351, TE = 0.1662
FS: TIDE = -0.0356, DE = -0.1985, TE = -0.2341
Model 2 DAG
The effect estimates for each outcome in model 2 are as follows:
PI: TIDE = 0.015, DE = 0.4131, TE = 0.4281
ICDAS: TIDE = 0.0002, DE = 0.4844, TE = 0.4846
DS: TIDE = 0.0325, DE = 0.7023, TE = 0.7348
MS: TIDE = -0.0076, DE = 0.6276, TE = 0.6200
FS: TIDE = 0.0253, DE = 0.2881, TE = 0.3134
Model 3 DAG
The effect estimates for each outcome in model 3 are as follows:
ABO: TIDE = 0.0031, DE = 0.0016, TE = 0.0047
BW: TIDE = 0.0705, DE = -0.0407, TE = 0.0299
Model 4 DAG
The effect estimates for each outcome in model 4 are as follows:
ABO: TIDE = -0.003, DE = 0.0200, TE = 0.0197
BW: TIDE = -0.1357, DE = 0.1392, TE = 0.0035
Model 5 DAG
The effect estimates for each outcome in model 5 are as follows:
ABO: TIDE = 0.0000, DE = 0.0071, TE = 0.0071
BW: TIDE = -0.0605, DE = -0.2666, TE = -0.3271
Model 6 DAG
The effect estimates for each outcome in model 6 are as follows:
PI: TIDE = 0.2198 DE = 0.2198, TE = 0.4902
ICDAS: TIDE = 0.0633, DE = 0.2604, TE = 0.3237
DS: TIDE = -0.0136, DE = 0.8397, TE = 0.8261
MS: TIDE = -0.0079, DE = 0.1364, TE = 0.1285
FS: TIDE = -0.0083, DE = -0.1977, TE = -0.2060
ABO: TIDE = 0.0003, DE = 0.0273, TE = 0.0276
BW: TIDE = -0.0771, DE = 0.0799, TE = 0.0028
Model 7 DAG
The effect estimates for each outcome in model 7 are as follows:
PI: TIDE = 0.0138, DE = 0.4162, TE = 0.4300
ICDAS: TIDE = 0.0036, DE = 0.4765, TE = 0.4801
DS: TIDE = 0.0314, DE = 0.7054, TE = 0.7368
MS: TIDE = -0.0072, DE = 0.6285, TE = 0.6213
FS: TIDE = 0.0246, DE = 0.2879, TE = 0.3125
ABO: TIDE = 0.0000, DE = 0.0038, TE = 0.0038
BW: TIDE = -0.0107, DE = -0.2718, TE = -0.2825
Model 8 DAG
The effect estimates for each outcome in model 8 are as follows:
PI: TIDE = 0.1019, DE = 0.3684, TE = 0.4703
ICDAS: TIDE = 0.0398, DE = 0.3083, TE = 0.3481
DS: TIDE = 0.0300, DE = 0.8411, TE = 0.8711
MS: TIDE = 0.0335, DE = 0.1686, TE = 0.2021
FS: TIDE = -0.0079, DE = -0.1917, TE = -0.1996
ABO: TIDE = 0.0267, DE = 0.0267, TE = 0.0267
BW: TIDE = -0.0046, DE = 0.0749, TE = 0.0704
Model 9 DAG
The effect estimates for each outcome in model 9 are as follows:
PI: TIDE = 0.040, DE = 0.4461, TE = 0.4501
ICDAS: TIDE = -0.0080, DE = 0.4289, TE = 0.4209
DS: TIDE = 0.0005, DE = 0.7016, TE = 0.7021
MS: TIDE = -0.007, DE = 0.6225, TE = 0.6155
FS: TIDE = 0.0069, DE = 0.3136, TE = 0.3205
ABO: TIDE = -0.0001, DE = 0.0039, TE = 0.0038
BW: TIDE = 0.0134, DE = -0.2720, TE = -0.2586